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Abstract 


In this paper, we study the problem of reporting all maximal 
collinear subsets of a point set S in R? for d > 3. An algorithm 
for this problem can be used to detect if any three of the points are 
collinear or find the line that intersects the most points in S. Besides, 
obtaining such maximal subsets is necessary for some problems about 
the collinearity relation among points, such as when covering them 
with the fewest lines. We present practical algorithms to find all 
maximal collinear subsets of a set of n points, including one with space 
complexity O(n) and time complexity O(dn? log n), and one with space 
complexity O(n?) and time complexity O(d?n?). 

Keywords: Collinear points, Degeneracy testing, Maximal collinear 
subsets. 


1 Introduction 


This paper studies the problem of enumerating all maximal collinear subsets 
of a set of n points in R@ for d > 3. The problem is interesting in itself as 
an extension of the two dimensional case. Two interesting applications are 
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reconstructing paths of straight-moving objects using a sample of points on 
the paths and detecting lines in digital images [3]. 


There are moreover other problems that need to identify all maximal 
collinear subsets as a first step, such as the point line cover problem, which 
asks to cover a set of points with the minimum number of lines [14], and 
the general position subset selection problem (finding the largest subset of 
points in general position) [11]. Although these problems are usually stated 
for the plane, they can easily be extended to higher dimensions. 


In two dimensions, the optimal time complexity for enumerating collinear 
points can be obtained either by constructing and sweeping arrangements 
of lines in the dual plane (with space and time complexity O(n?) [8]) or 
by topological sweeping (sweeping arrangements without constructing them 
with space complexity O(n) [7]). In higher dimensions, however, duality 
and arrangements do not seem to be the best way of approaching this 
problem. When the dual of the point p = (pi, po2,...,pq) is defined as 
the hyperplane p*: rg = aay pix; — pa, the transformation is incidence 
preserving [6]. A set of hyperplanes in the dual space intersect at a common 
point if their corresponding points in the primal space lie on the same 
hyperplane. The technique used in two dimensions for finding collinear 
points (mapping input points to lines in the dual plane and sweeping their 
intersections) cannot be easily extended to higher dimensions to detect 
collinear points. Furthermore, it is not efficient, given that constructing 
arrangements for d > 3 has time complexity O(n®) [8]. Other duality relations 
(especially for 3-dimensional spaces) have been defined for some ray casting 
and stabbing problems (see for instance chapter 8 of [17]), in which the goal 
is to find a line that intersects the maximum number of objects. The time 
complexity of constructing arrangements of lines for these relations usually 
is O(n*) (due to the combinatorial complexity of the resulting arrangements). 
This has been improved using Pliicker coordinates [12] in some applications. 


The problem of deciding whether any three of a set of points are collinear 
in a plane (or deciding whether they are in General Position or Degenerate, 
considering the collinearity relation) is closely related to the problem of 
enumerating collinear points. The former is clearly a special case of the 
latter. An extension of the degeneracy testing problem to higher dimensions 
was studied by Edelsbrunner et al. [8], the goal of which is to determine 
whether any d+1 of n input points in R@ lie on the same hyperplane. It has 
been shown that at least Q(n%) sidedness queries are required to solve this 
problem [10, 9]. Given that k +1 points in a k-dimensional subspace of R@ 
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implies that d+ 1 points lie in a (d — 1)-dimensional subspace (for k < d), 
finding maximal collinear points (and thus the problem studied in this paper) 
in R@ is a special case of d-dimensional degeneracy testing. 

Since the worst case time complexity of enumerating maximal collinear 
points in R? is @(n?), the time complexity of any algorithm for this problem 
in R¢ is trivially Q(n?). The following simple algorithm does so in O(dn*). 
For each pair of points, check which of the other points are on the line 
containing the pair; report the line if there is at least one such point. By 
defining an ordering on the points and checking only the points that appear 
after the current pair, the algorithm can avoid reporting duplicate lines. 

In this paper we present algorithms for this problem. In Section 2 we 
assume a stronger model of computation than the real RAM and present 
a randomized algorithm for detecting collinear subset of points with time 
complexity O(n”). In Section 3, we present an algorithm on the real RAM 
with time complexity O(dn?logn) and space complexity O(n), the basic 
idea of which is the same as sorting the points radially around each point 
in two dimensions. To improve the time complexity of this algorithm, in 
Section 4 we present an algorithm whose central idea is combining collinearity 
relations of the input points in their projections onto several two-dimensional 
subspaces of R¢. We also improve this algorithm to achieve the complexity 
of the algorithm of Section 3. Namely, we present an algorithm with time 
complexity O(d?n”) and space complexity O(n”) (Section 4.6), an algorithm 
with time complexity O(d?n? log m) and space complexity O(d?n), where m 
is the maximum number of convex layers in the projections of the points 
(Section 4.3), and an algorithm with time complexity O(dn? log n) and space 
complexity O(n) (Section 4.4). Efficient implementations of these algorithms 
are possible, as we restrict ourselves to orthogonal projections onto the 
planes defined by the axes of the coordinate system. 


2 Projection to an Arbitrary Plane 


To improve on the simple algorithm mentioned in the introduction for enu- 
merating collinear points in higher dimensions, we can project the input 
points onto a plane and use an efficient two-dimensional enumeration algo- 
rithm to find collinear points of the projection. The main obstacle of course 
is that noncollinear points may become collinear when projected onto the 
new plane. Therefore, we need to find a plane, in which the projection of 
three points are collinear, if and only if they are collinear in the space. The 
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problem resembles finding “nice viewpoints” [19]. Here our definition of 
niceness differs though. For three dimensions, identifying planes that do not 
have this property is not difficult, as shown in Lemma 1. 


Lemma 1 Given three noncollinear points a, b, and c in R?, their orthogonal 
projections onto a plane P are collinear, if and only if P contains the cross 
product vector ab x ae. 

Proof: We assume that a, b, and c are distinct. For simplicity we move 
the origin of the coordinate system to point a and rotate it so that b is on 
the a axis and c is on the xy-plane. Clearly, in the orthogonal projection of 
these points onto any plane P, points a, b, and c are collinear if and only if 
vectors ab and @é are collinear. We show that a and @é are collinear if and 
only if P is orthogonal to the xy-plane (in other words, P contains the cross 
product of ab and ae). 


Obviously, if ab and aé are projected onto any plane orthogonal to 
the xy-plane, their projections are collinear (their projections lie on the 
intersection of that plane and the xy-plane). For the other direction, we show 
that if P is not orthogonal to the xy-plane, the projections of ab and aé are 
not collinear. Let P’ be the plane that contains ab and is orthogonal to P. 
Clearly, the projection of ab onto P lies on the intersection of P’ and P. 
For ab and Gé to be collinear on P, @ should also lie on P’. This, however, 
is impossible, since P’ is not parallel to the ry-plane and the intersection 
of P’ and the ry-plane is a single line, containing ab (note that ab and ae 
are not collinear on the xy-plane). 


Lemma 1 suggests Algorithm 2.1 to find a suitable plane for projection. 

Algorithm 2.1 finds a plane that does not contain the cross product 
containing any three points of the input. Therefore, based on Lemma 1 
the projections of three points are collinear only if they are so in the space. 
Given that the size of V is O(n3), the time complexity of Algorithm 2.1 
is Q(n3), yielding no improvement compared to the naive algorithm presented 
in the Introduction. The algorithm, however, can be randomized as follows. 
Modify step 2 of the algorithm to select the vector v randomly and step 3 
to select the plane randomly. Given that in both steps there are a countable 
number of discrete values that the real random variable should avoid, the 
probability of failure of this Monte Carlo algorithm is zero and its expected 
time complexity is O(n”) (O(n) for computing the projections and O(n?) 
for the two-dimensional enumeration algorithm). 
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Algorithm 2.1: EnumerateCollinearPoints8D-Projecting(S) 


1 Let V be the set of cross products ab x bc for every three points a, 0, 
and cin S. 


2 Choose a vector v, whose direction is different from every vector in V. 
To do that, let w be a vector in V, which is closest to the y axis 
(but not on the axis itself). Define v as a vector between w and 
the y axis. 


3 Choose a plane P containing the origin and v, but no vector in V. 
Note that for any w in V, the vectors v and w define exactly one 
plane that contains them and the origin. 


4 Project all points in S to P and enumerate the collinear points in P 
using a 2-dimensional enumeration algorithm. 


It should be possible to extend this randomized algorithm to higher 
dimensions, relying on a result similar to Johnson-Lindenstrauss’s lemma [13] 
for random projection, which states that a set of points in a high-dimensional 
space can be projected onto a low-dimensional space, while preserving the 
distance between them (here, the collinearity relation between points should 
be preserved). However, this algorithm depends on a model of computation 
stronger than the real RAM, a model in which it is possible to access a 
stream of random bits. There is, furthermore, another disadvantage. Due 
to the its projection step, this algorithm cannot be entirely described using 
geometric predicates. The deterministic algorithms presented in the rest of 
this paper do not perform such projections and are thus more appealing in 
practice. 


3 Sorting the Points 


We can enumerate collinear subsets of a set of n point P on the plane using 
the following algorithm with time complexity O(n? logn). For each point p 
in P, sort other points radially around it. The members of every collinear 
subset of P that contain p appear contiguously in this sorted sequence. Thus, 
we can enumerate collinear subsets of P by checking contiguous elements 
after sorting for each point. In this section, we show how to extend this 
algorithm to higher dimensions. The main difficulty in extending this two 
dimensional algorithm is that there is no obvious way of sorting the points 
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radially in R¢. 
Definition 1 We define the normalized vector nv(p, q) of two points p and q 
in R¢ as follows: 


1 
dt — Pt 


(q—p), ifp#a 


0, ifp=4 
where t is the smallest index such that pe ¥ qe. 


The normalized vectors x = nv(p,q) and y = nv(p,r) for points p, q, 
and r, can be compared as follows: Let t be the smallest index such that 
x: # y,. If no such index exists, the vectors are equal. If x: < y;, then 
x <y. Otherwise, x > y. 


Lemma 2 The points p, q, andr are collinear if and only if 
nv(p,q) = nv(p,7r). 


Proof: Suppose nv(p,q) = nv(p,r). We seek to show that p, g, and r are 
collinear. Note that this is equivalent to saying that the vectors pj = q — p 
and pr =r —p are collinear. We show this by proving that one is a scalar 
multiple of the other. Formally, this can be stated as 


q—p=X(r—p) for some \ ER. 


Let ty and t, be the value of t in Definition 1 for nv(p,q) and nv(p,r), 
respectively. Also let Ag = q@ — pe and A, = ry — px. By the definition of 
normalized vectors, we have 


1 
nv(p, q) = >-(9— P) 
qd 
and 
nv(p,r) = —(r—p) 
PT me P); 
giving 


q-—p=Xr—p) 


Enumerating Collinear Points in Higher Dimensions 87 


where A = Ag/Ar. 

For the converse, suppose p, g, and r are collinear, and thus, 
q—p = X(r—p) for some A € R. Also, let tg, ty, Ag, and A; be de- 
fined as above. Since q—p = X(r — p), t, should be equal to t,. Let c= a 
Therefore, we have 


Cap Seo erp) 
This implies 
(4% — Pt) =c- > (Tt — Pt); 
Te — Pt 
which gives c = 1, and thus we have 
ia snp), 
which is equivalent to (q— p)/Aq = (r — p)/Ar- 


Algorithm 3.1: EnumerateCollinearPointsBySorting(S) 
1 Let the sequence o be an arbitrary ordering of S. foreach p in o do 


2 Let P=S \ {p}. 
3 Sort the points of P using nv(p,q) as key for each q € P. 


4 foreach maximal contiguous subsequence Q of the sorted sequence 
do 

5 if p appears before every entry of Q ino then 

6 | Report Q U {p}. 


Theorem 1 Algorithm 8.1 reports all maximal collinear subsets of a set S 
of n points in R¢ with time complexity O(dn?logn) and space 
complexity O(n). 


Proof: Let QU {p} be a maximal collinear subset of S, in which p 
appears first in o (Line 1). For every pair of entries q and o in Q, we 
have nv(p, q) = nv(p,o) (Lemma 2). Therefore, after sorting in the loop of 
Algorithm 3.1, all entries of Q appear contiguously and are reported in the 
last loop of the algorithm (Lines 5-7). 
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In each iteration of the algorithm, the time complexity of sorting 
is O(dnlogn) and the time complexity of the loop of Lines 5-7 is O(dn). 
Therefore, the total time complexity of the algorithm is O(dn? log n). Also, 
we can store the sorted list as pointers to the input list, which implies that 
the space complexity of the algorithm is O(n). 


4 Combining Collinearity Relations Across 
Projections 


Let S be the set of input points in R? and D be a set of planes. The points 
in a subset of S are collinear only if their projections to any plane in D are 
collinear as well. Inferring the converse of this statement (i.e. if a subset 
of S are collinear in R@, then their projections are collinear in all the planes 
in D) is possible if D is selected appropriately. In Section 4.1 we discuss this 
problem in more detail and use its results in the remainder of this section 
for enumerating collinear points in higher dimensions. 


4.1 Useful Lemmas about Collinearity and Projections 


Let Pj; be the plane containing the 7th and jth axes of the coordinate system 
in R?. For a vector v, we denote its ith component with v; and its projection 
to plane P,; with v7. Note that we have v! = (v;,v,;). We now present 
two lemmas about the collinearity of points and their projections. 


Lemma 3 Leta, b, andc be three points in R®. Let D contain (2) orthogonal 
planes Pj; for 1 < i,j < d andi < j. These points are collinear in the 
d-dimensional space if and only if they are collinear when projected onto 
each of these planes. 


Proof: Define v as the vector ae and u as ae. If points a, b, and c are 
collinear in R%, then either one of the vectors v and u is the zero vector 
(implying that at least two of the points are coincident and thus these 
three points are collinear in every projection) or they are linearly dependent 
(i.e. there exists a constant such that v = Au). If v = Au, we clearly have 
v) = \uJ, implying that the projections of the vectors v and u are also 
dependent and thus collinear. 

For the converse, we show that if the projections of these points to every 
plane P;; in D are collinear, they are also collinear in the d-dimensional 
space. If either v or u is a zero vector, at least two of these points are in the 
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same position in the space and the points are trivially collinear. Suppose not. 
There exists an index 7 such that both v; and u; are nonzero. Otherwise, 
let v, and u, be nonzero (since v and u are nonzero, indices x and y do 
exist). Assume without loss of generality that 2 < y holds (we can always 
relabel the points 6 and c for this to hold). By our assumption (that 7 does 
not exist), v, and u, are zero. In that case, these points cannot be collinear 
in plane P,,, contradicting our assumption about the nonexistence of 7. 
Suppose v; and u; are nonzero for some index i. Define \ as v;/uj. 
Since v7 and u” have the same direction for i < 7 < d, we have v! = \u) 
for every such j, implying that v; = Au,;. For 0 < j <7, we similarly have 
vj; = Au;, otherwise v)* and u* cannot be collinear in P;;. Therefore, we 
should also have v = Au, implying that v and u have the same direction 
and, therefore, a, b, and c are collinear in the space. 


The number of planes required to infer collinearity from projections 
can be significantly reduced by imposing some restriction on the points, as 
shown in Lemma 4. 


Lemma 4 Let a, b, and c be three points in R¢, and k be an integer such 
that l <k <d. Suppose a; = b; = cq for 1 <i<k, and az F by and az F Cx. 
Define D as the set of d—k orthogonal planes {Pxr41, Pre+2; ---, Pra}- 
These points are collinear in the d-dimensional space if and only if they are 
collinear when projected onto each of the planes in D. 


Proof: As in the proof of Lemma 3, define v as the vector ab and was ae. 


The first part of the proof is similar to Lemma 3 and will be omitted. 

For the other direction, we show that if the projections of these points 
onto all planes in D are collinear, they are also collinear in the d-dimensional 
space. Define A as vz/uz (note that by the definition of k, A is neither zero 
nor infinity). Since v* and u* for k+1<i<d have the same direction, 
we should have v** = \u** for every k+1<i<d. Therefore, we should 
also have v = Au, implying that v and u have the same direction and, 
thus, a, 6, and c are collinear. 


4.2 The First Algorithm: Combining Collinearity Across 
Projections 


In Algorithm 4.1 we use Lemma 3 to detect maximal collinear points in d 
dimensions. Its inputs are a set S of points and a set D of input planes onto 
which the points are projected. 
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Algorithm 4.1: EnumerateCollinearPoints-Combining(D, S) 

1 Initialize the set R to contain the single set S. During the algorithm R 
contains the set of maximal subsets of S that may be collinear. 

2 foreach plane P in D do 

3 Project all members of S onto plane P. 


4 Using a two-dimensional enumeration algorithm, obtain the set T 
of maximal collinear subsets of points in P. 

5 Split the subsets of R, so that every member of R is a subset 
of a member of T and the resulting members are maximal (the 
combination step). 


6 Report the sets in R with three or more members. 


Algorithm 4.1 maintains the set R to represent the maximal subsets of S 
that may be collinear. In each iteration of the loop, the set R of O(n?) subsets 
of S should be split based on the set T (also of size O(n?)). Maximality of 
the members of the resulting set means that if three points appear in the 
same set in both R and T, they should also appear together in a set of the 
result. Also, if three points do not appear in the same set of either R or T, 
they should not appear in the same set of the result. It can be easily shown 
that any two members of T have an intersection of size at most one (two 
lines can have at most one intersection). It can be inductively shown that 
this property also holds for R. In the proof of Theorem 2, we analyze the 
correctness and complexity of this algorithm. 


Theorem 2 When the set of planes D defined in Lemma 3 and a set 
of n points S in R® are specified as inputs, Algorithm 4.1 reports all 
maximal collinear subsets of S with time complexity O(d?n?) and space 
complexity O(n?). 


Proof: The combination step of the algorithm ensures that three points 
appear in the same member of R, unless they are not collinear in their 
projections onto any P in D. Based on Lemma 3, if a subsets of S$ is 
collinear in projections onto any plane in D, they are collinear in the d-space 
and therefore this algorithm reports all maximal collinear subsets of the 
points in S as required. 

Two-dimensional enumeration algorithms have time complexity 2(n7), 
due to the worst-case size of the output (the number of maximal collinear 
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subsets). In Section 4.6, we show how the combination step can be performed 
with time and space complexity O(n”). Therefore, Algorithm 4.1 has time 
complexity O(d?n”) and space complexity O(n”). 


If the first (or in general any other) coordinate of input points are 
distinct, we can obtain a more efficient algorithm by simply passing the set 
of planes specified in Lemma 4 to Algorithm 4.1 (Corollary 1). 


Corollary 1 Algorithm 4.1 reports all maximal collinear subsets of a set of 
points S with distinct first coordinates in R@ with the time complexity O(dn?) 
and space complexity O(n). 


4.3. The Second Algorithm: Combining Collinearity per Point 


To reduce the space complexity of Algorithm 4.1, not all of the maximal 
collinear subsets of S should be computed in each step of the enumeration 
algorithm. Instead, we can consider one of the input points at any time and 
identify maximal collinear subsets that contain it. Before proceeding, we 
define the p-collinearity relation. 


Definition 2 A set of points C are p-collinear for point p, if there is a 
line that contains all of the points in C U{p}. Also, two points q and o 
are p-collinear, if {q,0} is p-collinear. 


Theorem 3 shows that maximal p-collinear subsets of S can be obtained 
efficiently in a 2-dimensional plane [18]. 


Theorem 3 Given a set of points S in the plane and a point p, 
with O(nlogn) preprocessing a sequence s can be obtained with time com- 
plexity O(nlogm) and space complexity O(n), such that p-collinear subsets 
of S appear contiguously in s. Here m is the number of convex layers of S. 


Let s be the result of the algorithm described in Theorem 3 for point p 
and plane P. We can obtain maximal p-collinear subsets of points in S$ \ {p} 
by decomposing s into p-collinear subsets as follows. For each point in s, 
add it to the last subset in R if it is p-collinear with any of its members. 
Otherwise, append it to a new subset (note that some of these subsets may 
hold a single point). Since p-collinear points appear contiguously in s (as 
guaranteed by Theorem 3), the resulting subsets are maximal. We use this 
result in the following algorithm. Note that two maximal p-collinear subsets 
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of S \ {p} always have an empty intersection. This implies the sum of the 
sizes of the resulting p-collinear subsets is |S] — 1. 

We now present Algorithm 4.2, which is based on Theorem 3. It takes 
the set of input points S and a set of planes D onto which the points are 
projected. We analyze the time complexity of this algorithm in Theorem 4. 


Algorithm 4.2: EnumerateCollinearPoints-CombiningPerPoint(D, S) 


1 Let the sequence o be an arbitrary ordering of S. 


2 Perform the preprocessing required in Theorem 3 on the set of points 
for each plane in the set D. 


wo 


foreach p in o do 


4 Initialize the set R to contain the set S \ {p} as its only member. 
foreach plane P in D do 


5 Let T be the set of maximal p-collinear subsets of S \ {p}, 
when the points are projected onto P, obtained using the 
algorithm described in Theorem 3. 


6 Split the subsets of R, so that every member of R is a maximal 
subset of a member of T (the combination step). 


; For each member r in R, report r U {p} if r does not contain any 


point that appears before p in o. 


Theorem 4 When the set of planes D defined in Lemma 8 and a set of n 
points S in R@ are specified as inputs, Algorithm 4.2 reports all maximal 
collinear subsets of S with time complexity O(d?n?logm) and space com- 
plexity O(d?n), where m is the mazimum number of conver layers among 
projections of S onto any plane in D. 


Proof: The preprocessing in Algorithm 4.2 can be done with time 
complexity O(n log n) and space complexity O(n) per plane. Then, the set of 
maximal p-collinear subsets are combined for input planes (the logic is similar 
to Algorithm 4.1, except that here the subsets are p-collinear). In Section 4.5, 
we show how the combination step of Algorithm 4.2 can be performed with 
time complexity O(n). Finally, the sequence o (containing the members 
of S in an arbitrary order) is used to prevent reporting duplicate collinear 
subsets. Therefore, Algorithm 4.2 finds the maximal collinear subsets of S$ 
with time complexity O(d?n? log m) and space complexity O(d?n). 
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The space complexity improves over that of Algorithm 4.1 if d = o(./n), 
trading off an extra logm factor in the time complexity. For the general 
case, in which the first coordinate of the points are distinct, we can simply 
pass the set of planes defined in Lemma 4 to Algorithm 4.2 (Corollary 2). 


Corollary 2 Algorithm 4.2 can report all maximal collinear subsets of a set 
of points S with distinct first coordinates in R¢ with the time 
complexity O(dn? logm) and space complexity O(dn). 


4.4 The Third Algorithm: Decomposition Based on the Dis- 
tinctness of the Initial Coordinates 


In Algorithm 4.3, we decompose the set of points based on the value of their 
initial coordinates compared to those of the current point p. This allows us 
to use Lemma 4 instead of Lemma 3. 


Algorithm 4.3: EnumerateCollinearPoints-Decomposition(S) 


1 Let the sequence o be an arbitrary ordering of S. 

2 foreach p in o do 

3 Let C; for 1 <i <d+1 be a decomposition of S \ {p}, such that 
C; contains all points q with q; = p; for 1 < j <1 and either 
GAP ort=d+1. 

for i= 1 to ddo 

Initialize the set R to contain the set C; as its only member. 

for j =i+1toddo 


YO ap 


Let T be the set of maximal p-collinear subsets of C; when 
its points are projected onto P;;. This can be obtained 
by first sorting the members of C; radially around p. 


8 Split the members of R, so that each of them is a maximal 
subset of a member of T (the combination step). 


9 For each member r in R, report r U {p} if r does not contain 
any point that appears before p in o. 


10 Report Ca+1 U {p}, if it does not contain any point that appears 
before p in o. 


This also allows us to achieve the same time and space complexities as 
Theorem 1, as the next theorem shows. 
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Theorem 5 Algorithm 4.8 reports all maximal collinear subsets of a set S 
of n points in R¢ with the time complexity O(dn?logn) and_ space 
complexity O(n) (in addition to O(dn) for storing the input). 


Proof: For each point in the outer loop of Algorithm 4.3, S \ {p} is 
decomposed into C; for 1 <i<d-+1, such that the first 7 — 1 coordinates 
of the points in C; are equal to those of p (this can be performed with 
time complexity O(dn)). Lemma 4 implies that for computing all maximal 
p-collinear subsets of C;, we can consider only the planes P;; for i <7 <d 
(Lines 6-8). 

In each iteration of the outer loop of Algorithm 4.3 (Lines 2-10), S'\{p} is 
decomposed into sets C; for 1 <i < d+1 in O(dn). Also, using Algorithm 4.4 
for combining p-collinearity relations, the combining step (Line 8) can be done 
in O(|C;|). This implies that the total time complexity of Algorithm 4.3 is 


d 
n G +S0 (|C;| log |C;| +e) ; 
1 


which simplifies to O(dn? log n). The space complexity of every step of the 
algorithm is O(n). 


4.5 Combining Collinearity Relations for 
Algorithms 4.2 and 4.3 


Let R and T be two decompositions of S’, in which S’ = S' \ {p}. We 
obtain Q from R and T such that: i) Q is also a decomposition of 9’, ii) 
every member of @ is a subset of a member of R and also a member of T,, 
and iii) the members of Q are maximal in that if points g and o belong to 
the same members of R and S, they also appear in the same member of Q. 

Assign an index to each member of R and T from 1 to |R| and |T7’ 
respectively. Given that R and T are decompositions of S’, every point in S’ 
appears in exactly one member of R and T. Let R, for point q in S” be 
the index of the set in R containing q (define T, similarly). Based on the 
above definitions two points g and o should appear in the same set if and 
only if Rg = Ro and T, = T,. We therefore decompose 5S’ based on the value 
of R, and Tj for points like g. To do so efficiently, we use algorithm 4.4, 
which sorts the points based on the value of the tuple (Tj, Rg). 

After the first outer loop in Algorithm 4.4, LZ; contains all points like q 
such that Tj is 7. Due to the order of insertions, the entries in each L; are 
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Algorithm 4.4: Combine(R,T) 


a 


Q is the resulting set, initially empty. 


NO 


L; for 1 <i <n are empty lists. 

3 foreach member r of R in index order do 
4 foreach point q in r do 

5 | Append gq to the end of the list L7,. 


6 fori =1tondo 

7 foreach entry q in L; in the order of insertions do 

8 if Q is empty then 

9 Insert g into a new empty set. 
10 else 

11 Let o be any member of the last set in Q. If the value of 


Ry, = R, and T, = T, append q to the same set, otherwise 
insert q into a new empty set of Q. 


12 return Q. 


sorted based on the value of R,. Therefore in the second outer loop, the 
points are processed, ordered according to the value of their (Tj, R,) tuples. 
Algorithm 4.4 has time and space complexity O(n). 


4.6 Combining Collinearity Relations for Algorithm 4.1 


To achieve the time complexity O(d?n7) for Algorithm 4.1, its combination 
step should be performed in O(n”). Ideally we could have used Algorithm 4.4 
with time complexity O(m) for combining two decompositions of an m ele- 
ment set. Unfortunately, however, this is not possible, since in Algorithm 4.1 
the sets R and T are not decompositions of the set of input points and may 
contain O(n?) elements. In the rest of this section we present an O(n”) 
algorithm for the combination step of Algorithm 4.1. 

The main idea in Algorithm 4.5 is using Algorithm 4.4 after obtaining 
from sets R and T decompositions of an O(n?) element set. Then, we can use 
algorithm Algorithm 4.4 to combine them with the time complexity O(n7), 
and finally, convert back the result. 

Let A be the set of all pairs of points in the set of input points S. We 
use Algorithm 4.5 to obtain R’ and T’ from R and T respectively. We show 
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Algorithm 4.5: Convert(R) 

1 R’ is the resulting set, initially empty. 

2 foreach r in R do 

Let r’ be an empty set corresponding to r. 


foreach pair of points q and o in r do 


a - Ww 


| Append the pair (0, ¢q) to r’. 
6 Add r’ to R’. 


foreach pair of points (o,q) not appearing in any member of R' do 


“I 


8 Append a set to R’ containing only the pair (0, q). 


9 return R’. 


that R’ (and similarly 7”) is a partition of A. 


Lemma 5 Let S be a set of points and let A be the set of its unordered pairs 
{{a,b}:ae€ R,be S}. Given a partition R of S, Algorithm 4.5 returns a 
partition of A. 


Proof: Due to the condition in the second outer loop of Algorithm 4.5, 
every pair of input points appear in at least one member of R’. Therefore, 
it remains to show that the intersection of any two members of R’ is empty. 
Suppose this is not the case. Let (q, 0) be a pair of points that appear in two 
members of R’. Given that the last loop of Algorithm 4.5 does not introduce 
duplicate pairs, they must have been added to R’ in the first outer loop of 
the algorithm. Then, the members of R corresponding to the members of R’ 
that contain this pair have at least two points in their intersection (i.e., q 
and o), which contradicts the fact that R is a partition of S. 


After performing Algorithm 4.4 on the converted sets, we use Algo- 
rithm 4.6 to convert back the results. 

Since the time complexity of Algorithm 4.5 (to create the decomposi- 
tions), Algorithm 4.4 (to combine the sets), and Algorithm 4.6 (to convert 
back the result) is O(n”), the combination step of Algorithm 4.1 can be 
performed in O(n?). We now show that the resulting set is maximal. 


Lemma 6 Let R and T be partitions of a set of points, S. Define R' 
as Convert(R), T’ as Convert(T), Q’ as Combine(R’,T’), and Q as 
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Algorithm 4.6: ConvertBack(R’) 
1 F is the resulting set, initially empty. 
2 Remove elements of R’ whose size is 1 (i.e. contain a single pair). 


3 foreach r’ in R’ do 


4 Let r be an empty set corresponding to r’. 
5 foreach pair (o,q) in r’ do 
6 Append elements o and q to r. 


7 Add r to R. 


8 return R. 


ConvertBack(Q’). Three points a, b, and c appear in the some set of ¢ € Q, 
if and only if all of them appear in somer € Randt eT. 


Proof: If three points a, b, and c appear in r € R and t € T, every pair 
of them appear in r’ € R’ and t’ € T” (corresponding to r and t). Therefore, 
there is a member q’ of Q’ that contains all such pairs (due to the maximality 
of the subsets returned by Algorithm 4.4) and, thus, all three vertices appear 
in a set of Q (the member corresponding to q’). 

For the converse, suppose no element of R contains all three vertices a, b, 
and c. We show that no member of @ contains these three points either. 
Suppose a, b, and c belong to g € Q. Then, there is a member q’ of Q’, 
in which there are pairs containing each of a, b, and c (Algorithm 4.6), 
implying that these pairs also appear in r’ € R’ and t’ € T’ (as guaranteed 
by Algorithm 4.4). Due to the construction of R’ and T’ (Algorithm 4.5), 
all of these three points should appear in the corresponding members of R 
and T’,, contradicting our assumption. 


5 Concluding Remarks 


In half-space range searching, points inside query half-spaces, or equivalently 
on one side of query half-planes, are reported (for a survey, see [16] or [2]). 
Brnnimann et al. showed that under the arithmetic model of computation, 
an algorithm for half-space range searching with storage at most m, has 
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query complexity no better than 
_ (d~1) 
0 ((n/logny! eh fmt!) 


where d is the number of dimensions and n is the number of input points [4]. 
Despite the higher complexity of this problem and that this problem is usually 
studied in an online setting, some of the ideas employed by the algorithms 
presented for this problem may be applicable to the problem of enumerating 
collinear points. The main idea in many of the half-space range searching 
algorithms with almost linear storage is to recursively partition the points 
into many sets, each contained in a geometric object such that each query 
hyperplane intersects few of these objects. Only the points in intersecting 
objects are processed for each query (all of the points in the rest of the objects 
are either included or excluded in the result). A classic example is Matousek’s 
shallow partitions, which yields query complexity O(n!~!/l4/?! log? m+ k) 
with O(n log logn) storage and O(n logn) preprocessing [15]. For reporting 
collinear points we can use the same idea but instead of a hyperplane, the 
query remains a line in higher dimensions (actually every line defined by 
any pair of the points) and we are interested only in points on the query 
line. The time complexity of the resulting algorithm, however, does not seem 
promising, unless a better bound is obtained for the number of intersections 
between the objects and the query line. For two dimensions, Chazelle et 
al. presented an algorithm to achieve query complexity O(logn + k) with 
linear storage, in which & is the number of points in the result [5] (a similar 
result with slightly more storage is obtained for three dimensions by Afshani 
and Chan based on Matousek’s shallow partitions [1]). However, it may not 
be possible to extend the idea to the problem studied in this paper without 
increasing its query complexity. 
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